Learning Visual Symbols for Parsing Human Poses in Images

نویسندگان

  • Fang Wang
  • Yi Li
چکیده

Parsing human poses in images is fundamental in extracting critical visual information for artificial intelligent agents. Our goal is to learn selfcontained body part representations from images, which we call visual symbols, and their symbolwise geometric contexts in this parsing process. Each symbol is individually learned by categorizing visual features leveraged by geometric information. In the categorization, we use Latent Support Vector Machine followed by an efficient cross validation procedure. Then, these symbols naturally define geometric contexts of body parts in a fine granularity. When the structure of the compositional parts is a tree, we derive an efficient approach to estimating human poses in images. Experiments on two large datasets suggest our approach outperforms state of the art methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating common visual symbols in photography of the Islamic Revolution based on Pearce's pattern

The Islamic revolution, has remaining very influential on the way people lived, changed the path of art and especially the photography of Iran. There are many photographs of the Revolution. By examining the visual cues, the changes in the cultural and artistic trends of that period can be better understood. Are there any shared signs between the images in the photograph and do they share certai...

متن کامل

Learning to parse images of articulated bodies

We consider the machine vision task of pose estimation from static images, specifically for the case of articulated objects. This problem is hard because of the large number of degrees of freedom to be estimated. Following a established line of research, pose estimation is framed as inference in a probabilistic model. In our experience however, the success of many approaches often lie in the po...

متن کامل

Virtual to Real Reinforcement Learning for Autonomous Driving

Reinforcement learning is considered as a promising direction for driving policy learning. However, training autonomous driving vehicle with reinforcement learning in real environment involves non-affordable trial-and-error. It is more desirable to first train in a virtual environment and then transfer to the real environment. In this paper, we propose a novel realistic translation network to m...

متن کامل

Discriminative Hierarchical Part-Based Models for Human Parsing and Action Recognition

We consider the problem of parsing human poses and recognizing their actions in static images with part-based models. Most previous work in part-based models only considers rigid parts (e.g., torso, head, half limbs) guided by human anatomy. We argue that this representation of parts is not necessarily appropriate. In this paper, we introduce hierarchical poselets—a new representation for model...

متن کامل

Timescales for Sparseness of Natural Sound: Implications for Auditory-Symbols Processing

The statistical structure of the natural visual environment influences the strategies for sensory processing in the retina and the thalamus of primates, where second-order redundancy is reduced, and in the primary visual areas of cortex, where filters with high output-kurtosis, which expose the “sparseness” of the visual environment, are believed to facilitate scene parsing and object segmentat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013